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Abstract. Barndorff-Nielsen and Cox (1994, p. 3 19) modify an estimative 
prediction limit to obtain an improved prediction limit with better coverage prop- 
erties. Kabaila and Syuhada (2008) present a simulation-based approximation to 
this improved prediction limit, which avoids the extensive algebraic manipula- 
tions required for this modification. We present a modification of an estimative 
prediction interval, analogous to the Barndorff-Nielsen and Cox modification, to 
obtain an improved prediction interval with better coverage properties. We also 
present an analogue, for the prediction interval context, of this simulation-based 
approximation. The parameter estimator on which the estimative and improved 
prediction limits and intervals are based is assumed to have the same asymptotic 
distribution as the (conditional) maximum likelihood estimator. The improved 
prediction limit and interval depend on the asymptotic conditional bias of this 
estimator. This bias can be very sensitive to very small changes in the estima- 
tor. It may require considerable effort to find this bias. We show, however, that 
the improved prediction limit and interval have asymptotic efficiencies that are 
functionally independent of this bias. Thus, improved prediction limits and in- 
tervals obtained using the Barndorff-Nielsen and Cox type of methodology can 
conveniently be based on the (conditional) maximum likelihood estimator, whose 
asymptotic conditional bias is given by the formula of Vidoni (2004, p. 144). Also, 
improved prediction limits and intervals obtained using Kabaila and Syuhada type 
approximations have asymptotic efficiencies that are independent of the estimator 
on which these intervals are based. 

Keywords. Asymptotic efficiency; estimative prediction limit, improved 
prediction limit. 
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1. INTRODUCTION 



Suppose that {Yt} is a discrete-time stochastic process with probabihty distri- 
bution determined by the parameter vector 9, where the Yt are continuous ran- 
dom variables. Also suppose that is a Markov process, where Y^^^ — 
(Yf-p+i, . . . ,YtY For example, {Yt} may be an AR(p) process or an ARCH(j9) 
process. The available data is Yi, . . . ,Yn. Suppose that we are concerned with 
/c-step-ahead prediction where A; is a specified positive integer. Also suppose that 
© is an estimator of 9 with the same asymptotic distribution as the (conditional) 
maximum likelihood estimator. We note that there are many possible choices for 
©. For example, for a stationary Gaussian AR(1) model, commonly-used estima- 
tors of the autoregressive parameter include least-squares. Yule- Walker and Burg 
estimators. We use lower case to denote observed values of random vectors. For 
example, y^"^ denotes the observed value of the random vector Y^'^\ We also use 
the Einstein summation notation that repeated indices are implicitly summed 
over. 

Firstly, suppose that our aim is to find an upper prediction limit z{Yi, . . . , Y^), 
for Yn+k, such that it has coverage probabihty conditional on y(") = y^"^ equal 
to 1 — Q! i.e. such that 

Pe{Yn+k < z{Y^, . . . , F^) | F^") = y^")) = 1 - a 

for all 9 and y^"'\ The desirability of a prediction limit or interval having coverage 
probability 1 — a conditional on y^"^) = has been noted by a number of 
authors. In the context of an AR(p) process, this has been noted by Philhps 
(1979), Stine (1987), Thombs and Schucany (1990), Kabaila (1993), McCuUough 
(1994), He (2000), Kabaila and He (2004) and Vidoni (2004). In the context of an 
ARCH(p) process, this has been noted by Christoffersen (1998), Kabaila (1999), 
Vidoni (2004) and Kabaila and Syuhada (2008). 
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Define by the requirement that Pe{Yn+k < Za{9,y^''^) | F^^^ = 

— 1 — q; for all 6 and y^'^\ The estimative 1 — a prediction limit is defined to 
be Za{Q, y^"^). This prediction hmit may not have adequate coverage probabihty 
properties unless n is very large. It may be shown that the coverage probability 
of Za{Qi y*^'*-') conditional on F^") = y*^") differs from 1 — a by 0{n~^). Barndorff- 
Nielsen and Cox (1994, p. 319) modify (using a procedure clarified by Vidoni 
(2004) and described in detail by Kabaila and Syuhada, 2008, Section 2) the 
estimative prediction limit to obtain an improved prediction limit with better 
coverage properties. This improved limit, denoted 2;J(©,y^"^) and described in 
Section 3 of the present paper, has coverage probabihty conditional on y = 
that differs from 1 — o; by 0(n~^/^). The algebraic manipulations needed to obtain 
this improved prediction limit are feasible only for the simplest time series models. 
To avoid these manipulations, Kabaila and Syuhada (2008) propose a simulation- 
based approximation to this improved prediction limit. 

In the present paper, we extend these results to prediction intervals as fol- 
lows. In Section 4, we show that the estimative \ — a prediction interval has 
coverage probability conditional on Y^'^'> = y^^^ that differs from 1 — a by 0{n~^). 
In this section, we also present a modification of an estimative 1 — a prediction 
interval, analogous to the Barndorff-Nielsen and Cox (1994, p. 319) modification 
of an estimative prediction limit, to obtain an improved 1 — a prediction interval 
with better coverage properties. We show that this improved 1 — a prediction 
interval has coverage probability conditional on Y^"") = y*^"^ that differs from 
1 — q; by 0(n~^/^). To avoid the extensive algebraic manipulations required to 
find this improved prediction interval, we propose a simulation-based approxima- 
tion to this interval, analogous to Kabaila and Syuhada (2008) simulation-based 
approximation to the improved prediction limit. 

The improved 1 — a prediction limit and interval are obtained from the esti- 
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mative 1 — a prediction limit and interval, respectively, using a correction that in- 
cludes the asymptotic bias of 6 conditional on Y^^^ — y^^\ Kabaila and Syuhada 
(2007, Section 4) present an example showing that this bias can be very sensitive 
to small changes in the estimator 0. A further illustration of this fact is provided 
in Section 2 of the present paper. It may require considerable effort to find the 
asymptotic bias of © conditional on Y^'^^ — y^'^\ In Sections 3 and 4 we show, 
however, that the improved 1 — a prediction limit and interval have asymptotic 
efficiencies that are functionally independent of this bias. This has the following 
two consequences. Firstly, if the improved prediction limit or interval is obtained 
algebraically using the Barndorff-Nielsen and Cox (1994, p. 319) methodology 
or its analogue, respectively, then the estimative and improved prediction limits 
and intervals can conveniently be based on the (conditional) maximum likelihood 
estimator. This is because the asymptotic conditional bias of this estimator can 
be found using the very convenient formula of Vidoni (2004, p. 144). Secondly, 
improved prediction limits and intervals obtained using a Kabaila and Syuhada 
(2008) type simulation-based approximation have asymptotic efficiency that is in- 
dependent of the estimator 6, on which both the estimative and improved 1 — a 
prediction limits and intervals are based. Note that we assume throughout this 
paper that © has the same asymptotic distribution as the (conditional) maximum 
likelihood estimator. 

2. SENSITIVITY OF THE ASYMPTOTIC CONDITIONAL BIAS TO SOME 
SMALL CHANGES IN THE ESTIMATOR 

Consider a stationary zero-mean Gaussian AR(1) process {Yt} satisfying Y^ — 
pYf-i -|- Et, for all integer t, where |p| < 1 and the are independent and identi- 
cally A^(0, a^) distributed. Note that St and (l^-i, lt-2, ■ ■ ■) are independent for 
each t. The available data is Yi, Y2, . . . , Y^. 
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The least-squares estimator p = X]r=2 ^t^t-^/^t=i obtained by 

maximizing the log likelihood function conditional on Yi — yi- The Yule- Walker 
estimator pvw — X]"=2 ^^t-i/S"=i differs by a very small amount from 
p. However, as proved by Shaman and Stine (1988), E{p — p) = —2pn~^ + ■ ■ ■ 
and E{pyw ~ p) — — 3pn~^ This illustrates the great sensitivity of the 

asymptotic (unconditional) bias of an estimator of p to some small changes in 
this estimator. 

We illustrate the great sensitivity of the asymptotic bias of an estimator 
of p conditional on the last observation to some small changes in this estimator 
as follows. Define the estimator p = Ym^=2 ^tYt-i/^t=2 ^ti which is obtained 
by maximizing the log likelihood function conditional on 1^ = y„. This log 
likelihood function is found using the backward representation of the process: 
Yt = pYt^i + rjt, for all integer t, where the rjt are independent and identically 
A^(0, (T^) distributed. Note that rjt and (It+i, Yt+2, ■ ■ ■) are independent. 

The estimators p and p differ by only a small amount. They have the same 
asymptotic (unconditional) bias, since E{p — p) = —2pn~^ + ■ ■ ■ and E{p — p) = 
—2pn~^ Yet their asymptotic biases conditional on F„ — are quite 

different. These asymptotic conditional biases are described as follows. 

E{p- p\Y^ = y„) = {yl (1 - p')p (a^)-^ - 3p) n"^ + • • • 
E{^- p\Yn = yn) = -2pn~^ H 

These expressions for asymptotic bias may be obtained using the formula for the 
asymptotic conditional bias of the maximum likelihood estimator described by 
Vidoni (2004, p. 144). 
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3. EFFICIENCY RESULT FOR IMPROVED PREDICTION LIMITS 



Let F{ ■ ; 9,y^'^^) denote the cumulative distribution function of Yn+k, conditional 
on — Also, let f{-',0,y^'^^) denote the probability density function 

corresponding to this cumulative distribution function. Assume, as do Barndorff- 
Nielsen and Cox (1994) and Vidoni (2004), that 



Eede - e)(e - of \ = = i-^{e) + • • • 



(1) 



(2) 



where i{6) denotes the expected information matrix. We assume that every ele- 
ment of i{9) is 0{n~^). 

Define //a(e|y(")) = Pe{Yn+k < -Za(©,y^")) | ^^^^ = y^""^), which is the con- 
ditional coverage probability of the 1 — a estimative prediction limit. Using the 
fact that the distribution of Yn+k given (Yi, . . . , F„) = {yi, . . . , yn) depends only 
on it may be shown that //a(^|y(")) = Ee{F{za{Q,y^"^);0,y^''^) {Y^""^ = 
Now define ^^(e;^!!/^")) = F(z„(e, |/(")); ^, i/^'^)). Thus i/a(^| ?/(")) = 
Ee{Ga{0', 6'ly''"^) | ^^"-"^ = y^"^^)- We now use the stochastic expansion 

G.(e; ^lyW) =G^{e; ^|y(«)) + ^^-^^'^^^ 



^i_d^G^{e-e\y^-)) 



dOrdOs 



--d 



(3) 



By the definition of Za{0, y^")), Ga{0; e|y(")) = 1 - a. Thus Ha{9\y^"^) = 1 - a + 
Ca(6', y^'^^)n~^ + ■ ■ ■ where 



ca{e,y^'''^)n-' = n 



b{e,y^% + =- 



ld^Ga(9;9\y^''^) 



r (4) 



where 6(6*, y*^'^-*)^ denotes the ith component of the vector 6(6*, 7/*^"^) and denotes 
the (r, s)th element of the inverse of the expected information matrix i{e). In 
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other words, the conditional coverage probability of the estimative 1 — a upper 
prediction hmit Za{Q, F^"^) is 1 — a + 0{n~^). 
Define 

The improved 1 — a prediction limit described by Barndorff-Nielsen and Cox 

(1994, p.319) is 

4(6, Y^^^) = z,(e, F(")) + (i«(e, F^'^)). 

The conditional coverage probability of this improved prediction limit is PeiYn+k < 
4(6, I = y^'^)) = £;e(i^(4(©,y^"^);^,?/^''^) | = y^"^)- We now use 
the expansion 

= G«(e; 9\y^^'^) + /(^«(^, y^'^)); y^^^) d^{9, y^^^) + • • • 

Thus 

Pe(i;+fe<4(0,>^^"^)l>^^"^ = z/^"^) 

= i/„(^|2/(")) + /(;^a(^, ^, do,{9,y^'^y) + ■■■ 
= 1 - a + 0(n-^/2) 

Note that the improved prediction limit 4(®)^^"^) "^^Y be found algebraically 
using (4) and (5). When these algebraic manipulations become too complicated, 
the method of Kabaila and Syuhada (2008) may be used. For any given 9, these 
authors estimate Pe{Yr,+k < Za(e,y(")) | = y^")) - {I - a) by Monte Carlo 
simulation and use this estimate as an approximation to Ca{9,y^^^^)n^^ (which 
appears in (5)). In Kabaila and Syuhada (2008), the formula for r(a;, y'^'^^) should 
be n-^c(a;,?/("))//(z(?/("),a;);a;|?/(")) instead of c(a;,y("))//(z(y("),u;);u;|y(")), so 
that d{uj,y^''^^) = n~^c{uj,y^'^^), to order n~^. 



Wc measure the asymptotic efficiency of the improved prediction hmit 
2;+(6,F("^) by examining the asymptotic expansion of Eq{z^{Q,Y'^^^) | = 
y^")). In other words, this asymptotic efficiency is a function of 9 and y^"\ 
This measure of asymptotic efficiency is consistent with the general guidehnes 
put forward by Kabaila and Syuhada (2007) for comparing the efficiencies of 
prediction intervals. Using ^|?/^"^) = F(za(e, ^, we find that 

\2fiz^ie,y(-)y,0,yi-)) d9r 89^ ^2 dOrdO, )' 

Now, (0, T/*^'^-') is equal to 

Thus £;e(4(e,y(")) I y(") = y^")) is equal to 

(n (n)^ f'M9,y(%e,y(-^) 
^"^^'^ ^ 2/(z«(^,y("));^,yW) 9^, 9^, ' 

We see that the asymptotic conditional bias b{9,y^'^^)n^^ does not enter into this 

expression. This has the following two consequences. Firstly, if the improved 

prediction limit is found algebraically using (4) and (5) then we can use that 

estimator 9 whose asymptotic conditional bias is easiest to find. Usually, this will 

be the (conditional) maximum likelihood estimator whose asymptotic conditional 

bias can be found using the formula of Vidoni (2004, p. 144). Secondly, if the 

simulation-based method of Kabaila and Syuhada (2008) is used then we know 

that the asymptotic efficiency of the improved 1—a prediction limit is independent 

of the estimator ©, on which the estimative 1 — a prediction limit is based. 

Note that we assume throughout this paper that has the same asymptotic 

distribution as the (conditional) maximum likelihood estimator. 
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4. RESULTS FOR IMPROVED PREDICTION INTERVALS 



Suppose that our aim is to find a prediction interval [i{Yi, . . . , u{Yi, . . . , 
for such that it has coverage probability conditional on Y^"'^ — y^"'^ equal 

to 1 — q; i.e. such that 

Pe{Yn+k e [^(^1, ...,Yn), ti(Yi, . . . , Fn)] I Y^^^ = = 1 - a 

for all and As in Section 3, define F{-;e, and f{-;0, to be the 
cumulative distribution function and probability density function (respectively) 
of Yfi+k: conditional on F^**) — y^'^\ Suppose that f{-;0,y^'^^) is a continuous 
unimodal function for all y^") and 9. 

Define y^"^) and Ua{0, by the requirements that i/^'')); ^, i/^'*)) = 

and 

for all 6* and y'^"\ If 6* is known then [^a(6', t/*^")), MqI^*, Z/*-"^)] is the shortest 
prediction interval for Y^+k, having coverage probability 1 — a conditional on 
y(") — We define the estimative 1 — a prediction interval to be 

Assume that (1) and (2) hold true. 

Define = Pe{Yn+k e 4(0, y^*")) | r^*") = which is the con- 

ditional coverage probability of the 1 — a estimative prediction interval. Using 
the fact that the distribution of 1^+^ given (Yi, . . . , 1^) = (yi, . . . , yn) depends 
only on it may be shown that H^{9\y^''^) = Ee{Go,{Q; ^b^"^) | F^") = yW), 
where we define G ^{Q] e\y^'^'^) = F(k«(©, yW); ^, - F(4(e, 
We now use the expansion (3). By the definition of 4(^,?/(")) and ?/«(^, 
G„(^;^|l/(")) = I - a. Thus H^{e\y^'''^) = I - a + c„(^,|/("V"^ + ••• where 
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Ca{9,y^''^^)n ^ is given by (4). In other words, the conditional coverage probabil- 
ity of the estimative 1 — a upper prediction interval Y'-^^) is 1 — a + 0{n~^). 

Suppose that di{e, and d^iO, are both 0{n-^) for every 9 and 
Also suppose that 

Note that we could replace f{ua{0, y*^"^); 0, y^"-*) in the denominator of the expres- 
sion on the right-hand side by ?/^"^); ^, y^""^), since f{ea{0,y^''^);9,y^''^) = 
f{ua{0,y^"^);9,y^"'^). The improved 1 — a prediction interval is 

7+(e,y(-)) = [4(0, ^^"^) - di{e,Y^^^), u^{e,Y^^^) + di{e,Y^^^)]. 

The conditional coverage probability of this improved prediction interval is 

P,(F„+,e/+(e,y("))|F(")=/")) 

- F(4(©, - <(©, 1^^")); ^, y^")) | Y^^^ = 

We now use the stochastic expansion 

- F(4(e,y("));^,y('^)) + /(^(e, y^) 4(6, + • • • 

Thus 

= + f{za{0, ^, y^")) {di{0, + y("))) + • • • 

= 1 - a + 0(71-=^/") 
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Note that the improved prediction interval /(^(©, F*^"-') may be found algebraically 
using (4) and (6). When these algebraic manipulations become too complicated, 
a simulation-based method, similar to that described by Kabaila and Syuhada 
(2008) for prediction intervals, may be used. For any given 9, we estimate 
Pe{yn+k e /a(e,y('')) I = - {I - a) by Monte Carlo simulation and 
use this estimate as an approximation to Ca{9^y^'^^)n''^ (which appears in (6)). 

We measure the asymptotic efficiency of the improved prediction interval 
/+(6, F*^")) by examining the asymptotic expansion of £^51 (length of /+(6, y*^"^) | 
y(n) _ In other words, this asymptotic efficiency is a function of 9 and 

Using G«(©; 0\y'^''y) = F{ua{Q, 9, - F(4(e, 9, |/(")), we find 



that 



dGa{9;9\y^^^) 



and 



d9, 



d'G^i9;9\y(^^) 



d9i 



d9i 



d9rd9s 



-J[u^{U,y ),u,y ) 



(n)\ 



f{e^{9,y^%9,y^^^) 



d9r 



39. 



- f(e (9 v^''h9 

J[i^[U,y ),u,y ) 



We now substitute these expressions into (4) and (6), to obtain an expression 
for di{9,y^"^) + dl{9,y^''^) in terms of b{9,y^''^)i and ^^ Now, the length of 
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/+(0,|/^"'') is equal to 



Thus ^^(length of /+(e,7/(")) | F^") = is equal to 

,(n)^ _ , An). 1 f f'MO, y^^^y,9, y dua{9, yH) du^{9, y W) 



2V/K(e,yW);^,yH) d9r 09^ 



f{e^{9,y(-));9,y(-)) d9r 89^ 
We see that the asymptotic conditional bias 6(6*, does not enter into this 

expression. This has the following two consequences. Firstly, if the improved 
prediction interval is found algebraically using (4) and (6) then we can use that 
estimator whose asymptotic conditional bias is easiest to find. Usually, this will 
be the (conditional) maximum likelihood estimator whose asymptotic conditional 
bias can be found using the formula of Vidoni (2004, p. 144). Secondly, if the 
simulation-based method, similar to that of Kabaila and Syuhada (2008), is used 
then we know that the asymptotic efficiency of the improved 1 — a prediction 
limit is independent of the estimator 0, on which the estimative 1 — a prediction 
limit is based. Note that we assume throughout this paper that B has the same 
asymptotic distribution as the (conditional) maximum likelihood estimator. 

Now consider the particular case that /( • ; ^, y^"^) is also symmetric about 
m{9,y^"'^) for all y*^") and 9. In other words, suppose that, for every y*^") and 9, 
f{m{9,y^'^^)—w;9,y'^"-^) — f {rn{9,y^'^^)-\-w]9,y^'^^) for all -u; > 0. In this case, we 
may choose d[^{9, y(")) = dl{9, y(")) = 5^{9, y(")), say. Define Wa{9, y(")) by the re- 
quirement that (>"n+fe e [m(^,y("))-w„(^,y(")), m(^, y("))+w„(^, 2/^"^] | ^'^''^ = 
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yW) = 1-a for all 9 and Thus = m„(^, - ^^.(^,1/^"^) and 

Uai^^^y'^^^) — ^a(^, y^'*'') + '"^a(^, y*-"'^)- It may be shown that Sa{0,y^'^^) is equal 
to 

1 



The improved prediction interval [4(6, y('^))-(5«(e, y(")), ^^(e, r("))+(5«(e, F^"))] 
has been considered in the context of one-step-ahead prediction for {1^} a station- 
ary zero-mean Gaussian AR(1) process by Kabaila and Syuhada (2007, Section 
4), where the formula for d{9,yn) should be — c(^, y„)/(2v~^/^0(2;i_|)) instead of 

-c(e,y„)/(20(^i--))- 
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